Deep learning-based noise robust flexible piezoelectric acoustic sensors for speech processing

نویسندگان

چکیده

Flexible piezoelectric acoustic sensors (f-PAS) have attracted significant attention as a promising component for voice user interfaces (VUI) in the era of artificial intelligence things (AIoT). The signal distortion issue highly sensitive biomimetic f-PAS is one most challenging obstacle real-life application, due to fundamental difference compared with conventional microphones. Here, noise-robust flexible sensor (NPAS) demonstrated by designing multi-resonant bands outside noise dominant frequency range. Broad coverage up 8 kHz achieved adopting an advanced membrane (Nb-doped PZT; PNZT) optimized polymer ratio. Deep learning-based speech processing multi-channel NPAS show outstanding improvement speaker recognition and enhancement commercial microphone. Finally, filtered crowd condition noises, showing independent speaker’s speeches can be identified digitalized simultaneously. To fabricate (NPAS), are designed range, via resonance mechanism. material dimensional effect analysis. • We was Nb-doped PZT membrane. our showed

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Flexible, Robust, and Efficient Human Speech Processing

Present-day speech technology systems try to perform equally well or preferably even better than humans under specific conditions. For more complex tasks machines frequently show degraded performance, because their flexibility, robustness and efficiency is lower than that of humans. In order to better understand the system limitations and perhaps further improve system performance, one can try ...

متن کامل

A Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication

In this paper, we present a speech enhancement technique based on the ambient noise classification that incorporates the Gaussian mixture model (GMM). The principal parameters of the statistical modelbased speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method and the long-term smoothing parameter of the noise estimation, are set according to the class...

متن کامل

Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search

Example-based speech enhancement is a promising singlechannel approach for coping with highly nonstationary noise. Given a noisy speech input, it first searches in a noisy speech corpus for the noisy speech examples that best match the input. Then, it concatenates the clean speech examples that are paired with the matched noisy examples to obtain an estimate of the underlying clean speech compo...

متن کامل

Factored Deep Convolutional Neural Networks for Noise Robust Speech Recognition

In this paper, we present a framework of a factored deep convolutional neural network (CNN) learning for noise robust automatic speech recognition (ASR). Deep CNN architecture, which has attracted great attention in various research areas, has also been successfully applied to ASR. However, to ensure noise robustness, since merely introducing deep CNN architecture into the acoustic modeling of ...

متن کامل

Robust Features in Deep-Learning-Based Speech Recognition

Recent progress in deep learning has revolutionized speech recognition research, with Deep Neural Networks (DNNs) becoming the new state of the art for acoustic modeling. DNNs offer significantly lower speech recognition error rates compared to those provided by the previously used Gaussian Mixture Models (GMMs). Unfortunately, DNNs are data sensitive, and unseen data conditions can deteriorate...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nano Energy

سال: 2022

ISSN: ['2211-3282', '2211-2855']

DOI: https://doi.org/10.1016/j.nanoen.2022.107610